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Abstract 

Compressive sensing (CS) technologies present many advantages over other existing approaches for 
implementing wideband spectrum sensing in cognitive radios (CRs), such as reduced sampling rate and 
computational complexity. However, there are two significant challenges: 1) choosing an appropriate 
number of sub-Nyquist measurements, and 2) deciding when to terminate the greedy recovery algorithm 
that reconstructs wideband spectrum. In this paper, an autonomous compressive spectrum sensing 
(ACSS) framework is presented that enables a CR to automatically choose the number of measurements 
while guaranteeing the wideband spectrum recovery with a small predictable recovery error. This 
is realized by the proposed measurement infrastructure and the validation technique. The proposed 
ACSS can find a good spectral estimate with high confidence by using only a small testing subset in 
both noiseless and noisy environments. Furthermore, a sparsity-aware spectral recovery algorithm is 
proposed to recover the wideband spectrum without requiring knowledge of the instantaneous spectral 
sparsity level. Such an algorithm bridges the gap between CS theory and practical spectrum sensing. 
Simulation results show that ACSS can not only recover the spectrum using an appropriate number 
of measurements, but can also considerably improve the spectral recovery performance compared with 
existing CS approaches. The proposed recovery algorithm can autonomously adopt a proper number of 
iterations, therefore solving the problems of under-fitting or over-fitting which commonly exist in most 
greedy recovery algorithms. 
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Index Terms 

Cognitive radio, Spectrum sensing, Compressive sensing, Sub-Nyquist sampling. 

I. Introduction 

The radio frequency (RF) spectrum is a finite natural resource, currently regulated by govern¬ 
ment agencies. According to current policy, primary user (PU) on a particular spectrum band 
has exclusive right to the licensed spectrum. With the explosive growth of wireless applications, 
the demands for RF spectrum are constantly increasing. On the other hand, it has been reported 
that localized temporal and geographic spectrum utilization efficiency is extremely low IfTll. Ell. 
Cognitive radio (CR) 0 has emerged as one of the most promising solutions that address the 
spectral under-utilization problem. A crucial requirement of CRs is that they must rapidly exploit 
spectrum holes (i.e., portions of the licensed spectrum that are not being used by PUs) without 
causing harmful interference to PUs. This task is achieved by spectrum sensing, which can be 
defined as a technique for achieving awareness about the spectral usage and existence of PUs 
in a given geographical area 0], Q. 

CR with a wide spectral awareness (e.g., a few GHz rather than MHz) could potentially exploit 
more spectral opportunities and achieve larger capacity. Wideband spectrum sensing techniques 
(categorized into Nyquist wideband sensing and sub-Nyquist wideband sensing) therefore have 
attracted considerable attention in research on CR networks (2l|. In 0, Tian and Giannakis 
proposed a wavelet based approach using Nyquist sampling rate for wideband spectrum sensing. 
Quan et al. 0, 01 presented a multiband joint detection (MJD) approach to detect the primary 
signal from Nyquist samples over multiple frequency bands. Note that according to the Nyquist 
sampling theory, the received signal at CR should be sampled at a sampling rate of at least twice 
the maximum signal frequency 0. Thus, to achieve a “wider” spectral awareness at CRs (i.e., 
a larger signal frequency range), a high sampling rate is needed, leading to excessive memory 
requirements and high energy cost. This motivates the development of sub-Nyquist technologies 
(using sampling rates lower than the Nyquist rate) for reducing the operational sampling rate 
while retaining the spectral information 0, iflOll . 

The compressive sensing (CS) theory was first introduced to implement the sub-Nyquist 
spectrum sensing in CR networks in ifTTIl . This technique used a number of samples closer to the 
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information rate and reconstructed the wideband spectrum using these partial measurements. Note 
that using CS techniques, the wideband signal to be sampled is required to be sparse in a suitable 
basis Ifl2ll . Ifl3ll : this requirement can typically be met in CR networks due to the low spectral 
occupancy 0. Several sub-Nyquist wideband spectrum sensing algorithms were proposed to 
mitigate the effects of multipath fading in cooperative CR networks in liTHl - lUTlI . After sub- 
Nyquist sampling, the wideband signal can be recovered from these sub-Nyquist samples by 
using one of several possible recovery algorithms, e.g., orthogonal matching pursuit (OMP) Ifl8ll . 
lH9ll or compressive sampling matching pursuit (CoSaMP) Poll . PH . Given a known sparsity 
level such as k, an appropriate number of measurements (samples) M = C 0 klog(N/k) can be 
chosen such that the quality of recovery can be secured, where C 0 denotes a constant and N 
denotes the number of measurements if the Nyquist rate is utilized. Consequently, such CS- 
based algorithms can take advantage of using sub-Nyquist sampling rates for signal acquisition, 
instead of the Nyquist rate, leading to reduced energy consumption, complexity, and memory 
requirements. 

It is worthwhile to emphasize that directly applying CS theory to CR networks may lose its 
inherent advantages in practice. This is because to guarantee a high successful recovery rate, 
CS approaches tend to pessimistically choose the number of measurements M larger than that 
is necessary: For example, as depicted in Fig. [H when k = 10, M = 33%N can be used for 
guaranteeing a very high successful recovery rate; but this is not always necessary because by 
using fewer measurements we may still recover the spectrum with an appropriate or predefined 
probability. Most importantly, the number of measurements M is always linked to the spectrum 
sparsity level k, which means the knowledge of k will be required for determining an appropriate 
value of M in CR networks. However, the sparsity level of the radio spectrum is often unknown 
due to either the dynamic activities of PUs or the time-varying fading channels between PUs and 
CRs 0. Because of this sparsity level uncertainty in practical CR networks, most CS approaches 
intend to further increase measurements to ensure a high successful recovery rate, thereby leading 
to more unnecessary energy consumption. For example, in Fig. [H for the uncertainty range 
10 < k < 20, M = 50 %N (rather than M = 33 %N) will be selected, which does not fully 
exploit the inherent advantages of using CS techniques for implementing wideband spectrum 
sensing in CR networks. 

Against the aforementioned background, this paper aims to bridge the gap between CS theory 
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and practical spectrum sensing. In particular, the novel contributions of this paper can be 
summarized as follows: 

• An autonomous compressive spectrum sensing (ACSS) framework is proposed for recover¬ 
ing the wideband spectrum by using an appropriate number of compressive measurements. 
This framework does not require prior knowledge of the instantaneous spectral sparsity 
level, resulting in reduced system complexity. Performance analysis is given to show that 
the proposed ACSS framework can inherently avoid excessive or insufficient numbers of 
compressive measurements, and help improve CR system throughputs. 

• A novel validation approach is proposed to accurately estimate the actual spectral recovery 
error with high confidence by using only a small amount of testing data. Note that the 
actual spectral recovery error is typically unknown as the actual wideband spectrum is not 
accessible under sub-Nyquist rate. This validation approach applied in the ACSS framework 
enables compressive measurement acquisition halted at an earliest appropriate timeQ. 

• To extend the use of ACSS to noisy measurement environments, another validation method 
is proposed. Theoretical analysis shows that, if a good spectral estimate exists, the proposed 
validation method can find it with a very high probability by using a small testing subset. 

• A sparsity-aware spectral recovery algorithm is designed for spectral recovery without 
requiring knowledge of the instantaneous spectral sparsity level. Iterations of the recovery 
algorithm are analyzed and shown to be able to terminate at the correct iteration index, 
which therefore reduces the possibilities of under-/over-fitting. 

The rest of the paper is organized as follows. Section HD introduces compressive spectrum 
sensing problems and the system model. Section HID presents the ACSS framework and analyzes 
its halting criterion. ACSS is then applied and analyzed in noisy environments in Section HYl and 
the sparsity-aware recovery algorithm is proposed in Section |Vj Simulation results are presented 
in Section ED with conclusions in Section IVII1 We note that, throughout this paper, letters with 
horizontal arrows above them are used to represent vectors, e.g., x and X where the lowercase 
letter denotes the time-domain and the uppercase letter denotes the Fourier domain. Uppercase 
boldface letters are used to denote matrices, e.g., T>. And an N x jV discrete Fourier transform 

'Please note that Bayesian compressive sensing |22l, 1231 can also simultaneously perform reconstruction and validation, and 
determine the confidence level of estimation results. 
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Fig. 1. In a traditional CS system, the successful recovery rate varies when the number of measurements and the sparsity level 
vary 0 In simulations, we assumed N = 200 and varied the number of measurements M from 20 to 180 in eight equal-length 
steps. Additionally, we chose the sparsity level k G [1, M] and adopted Gaussian measurement matrices. After 5000 trials of 
each parameter setting, we obtained this figure. 
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(DFT) matrix is denoted by Fjv, where F A? ' denotes the inverse of the matrix Fa?. 

II. System Model and Problem Statement 

Consider that a CR node receives an analog signal x(t) from PUs, which has the frequency 
range 0 ~ W Hz. Based on the Nyquist sampling theory, such an analog signal should be 
sampled at the sampling rate / > 2W Hz. After a small time step r (seconds) of Nyquist 
sampling, we will obtain a full signal vector x E C ,¥xl , where N = fr (an integer number by 
properly choosing the sampling rate) denotes the number of samples. 

CS theory indicates that a sparse signal can be acquired by using a sub-Nyquist sampling rate 
f s ( f s < 2W), which results in fewer samples than predicted on the basis of Nyquist sampling 
theory. The value of f s is determined by the potential under-sampling fraction multiplying /. 
Since the spectrum is often sparse in CR networks due to the low spectral occupancy ifTTIl . CS 
theory has been applied for signal acquisition at CRs lfl4ll . Ifl5ll . If24ll . Here, the use of a sub- 
Nyquist sampler, such as the random demodulator Il25ll . will generate a compressive measurement 
vector y G C Mxl (M = f s r < N ). Mathematically, the compressive measurement vector y can 
be written as y = T» r, where x denotes the signal vector if the Nyquist rate is employed, 
and denotes an M x N measurement matrix that can be implemented using a sub-Nyquist 
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sampler. If the signal x is /.--sparse (k < M < N) in some basis and the measurement matrix 
is appropriate, we can recover x from y using recovery algorithms. This actually means that, 
using CS theory, we can obtain x by merely using the sub-Nyquist sampling rate f s , instead of 
the Nyquist sampling rate /. 

The basic structure of CS-based spectrum sensing (also called compressive spectrum sensing) 
used in this paper is shown in Fig. 13 The aim is to recover x and its DFT spectrum X = Fjyx 
from compressive measurements y, and then perform spectrum sensing using the recovered 
signal x or its DFT spectrum X. For an overview of state-of-the-art compressive spectrum 
sensing techniques, the reader is referred to @. Spectral domain energy detection ll26ll is a 
typical spectrum sensing approach, and thus is adopted in this paper. As shown in Fig. [2l using 
this approach, we can extract the recovered spectrum within the frequency range of interest (e.g., 
A/) and calculate its signal energy. A detection threshold (denoted by A) is then chosen and 
compared with the signal energy to decide whether this frequency band is occupied or not, i.e., 
choosing between binary hypotheses (occupied) and A/ 0 (not occupied). 



Compressive Spectrum Sensing 

Fig. 2. Diagram of compressive spectrum sensing: The spectral domain energy detection approach is used for spectrum sensing. 


According to the structure of compressive spectrum sensing, we know that the recovery quality 
will have significant impact on the performance of compressive spectrum sensing. The recovery 
quality depends on the following factors: the sparsity level, the choice of measurement matrix, the 
recovery algorithm, and the number of compressive measurements. The sparsity level of spectrum 
in CR networks is mainly determined by the PUs’ activities within a frequency range and the 
medium access control (MAC) of the CRs. To evaluate the suitability of a chosen measurement 
matrix, we adopt an elegant metric: the restricted isometry property (RIP) iTTOll . In ll25ll and If27l 
sub-Nyquist samplers with controllable measurement matrices have been proposed to realize CS. 
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Using such samplers, the primary signal received at CRs is first modulated by pseudo-random 
sequences (which are determined by pseudo-random seeds), and then sampled by standard 
low-rate samplers. Since these pseudo-random sequences are known and controllable, we can 
easily construct known measurement matrices subject to satisfactory RIP. For a comprehensive 
understanding of RIP and measurement matrix design, the reader is referred to ll28ll . Il29ll and 
Poll . OH . respectively. In the rest of this paper, we will thus focus on discussing the following 
two factors: the number of measurements and the recovery algorithm. 

III. Autonomous Compressive Spectrum Sensing (ACSS) 

In this section, we will propose the ACSS framework enabling us to gradually acquire com¬ 
pressive measurements using the sub-Nyquist sampling rate, recover the DFT spectrum, and 
halt the compressive measurements at the correct time. The halting criterion and performance 
analysis will be provided to show that ACSS can avoid excessive or insufficient numbers of 
compressive measurements. 

A. Model and Framework of ACSS 

Consider that CR networks utilize a periodic spectrum sensing structure and each time frame 
has a fixed length L (seconds) which consists of a spectrum sensing time slot and a data 
transmission time slot, as depicted in Fig. |3l The spectrum sensing duration T (0 < T < L) is 
adjustable and equals p (a positive integer) times as long as the small time step r, i.e., T = pr. To 
guarantee the bit rate at CRs, at least T min (seconds) should be reserved for data transmission; 
thus, the spectrum sensing duration T will satisfy L — T = L — pr > T min , equivalently, 
p < CCLmin_ Here, we assume that the spectrum sensing duration T is smaller than the channel 
coherence time, such that the magnitude of the channel response remains constant within T. In 
addition, we assume that, within T, the primary signals are wide-sense stationarity and all CRs 
can keep quiet as enforced by protocols (e.g., at the MAC layer Q). This means that the spectral 
components of the DFT spectrum X = F N x arise only from PUs and background noise. Due 
to the low spectral occupancy in CR networks lUTll . the DFT spectrum X can be assumed to be 
fc-sparse, which means the spectrum consists only of the k largest values that cannot be ignored. 
This sparsity level k is typically unknown but has a known upper bound A’ rnax . This is because, in 
practice, the instantaneous spectral occupancy may be difficult to obtain, but the maximal spectral 


February 24, 2015 


DRAFT 



occupancy can be easily estimated by long-term spectral usage measurements. For example, the 
maximal spectral occupancy within 30 MHz - 3 GHz in New York City has been reported to be 
only 13.1% HU. In such a scenario, k max can be calculated by k max = 13.1% x N. 



■ » • 


Fig. 3. Frame infrastructure of periodic spectrum sensing in cognitive radio networks. 


Using ACSS, we perform compressive measurements using the sub-Nyquist sampling rate 
f s (f s < 2W). The same sub-Nyquist sampler is adopted throughout the spectrum sensing 
duration T, and the corresponding measurement matrices follow the same distribution, e.g., the 
standard normal distribution, or the Bernoulli distribution!] with equal probability on ±1 ll9l . 
IITOll . Furthermore, the set of compressive samples within T is denoted by y p (y p G C MpXl ), 
where M p = f s T = f s pr is the number of compressive measurements. The set of compressive 
samples y p is then divided into two subsets including the training subset R p (R p e O xl ) to 


recover the spectrum, and the testing subset 


Yp (V p G O xl ) to validate the recovered spectrum, 


where M p = r p + v p and there is a trade-offl between v p and r p . Based on CS theory, the two 


subsets can be expressed as 


7? = <r> f = <T> F'bX 


( 1 ) 


and 


V = \p f = tp F~trX 

Vp ^ pJ^p p±. ppj-YYp, 


( 2 ) 


respectively, where T» p is an r p x pN measurement matrix, x p G C pNxl denotes the signal 


2 It has been proved in 0 and Eo) that, if the number of measurements is appropriate, the measurement matrix with either 
Gaussian or Bernoulli distribution can secure the RIP condition with an overwhelming probability. 

3 Given a fixed value of M p , a larger value of v p could result in higher probability of finding the best spectral approximation; 
while on the other hand, it leads to worse spectral recovery since r p = M p — v p becomes less. 
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vector if the Nyquist sampling rate is used within T, X p denotes the DFT spectrum of x p such 
that X p = F pN x p , and ^f p is a v p x pN testing matrix. Using the OMP recovery algorithm 
in Ifl9ll . we could obtain a spectral estimate X p from R p . When we adjust the spectrum 
sensing duration T = pr step by step (via increasing p), a sequence of spectral estimates, i.e., 
Xi, X 2 , • • • , X p , will be obtained. The compressive sampling will be halted once a satisfactory 
spectral estimate is found that meets the halting criterion, or the satisfactory spectral estimate 
cannot be found within the given time. 

The work flow of ACSS is shown in Table U The halting criterion will be analyzed in Section 
IIII-B1 We emphasize that unlike traditional CS approaches, the proposed ACSS divides the 
spectrum sensing duration into several mini time slots, performs compressive sampling step 
by step, and halts the sampling at an earliest appropriate time (once an appropriate spectral 
estimate is found). In this case, some spectrum sensing time slots can be saved and then used 
for data transmission, which will not only improve the CR system throughput (by using longer 
transmission time) but also save energy used for spectrum sensing. Furthermore, unlike other 
CS approaches, the proposed ACSS does not require the knowledge of the spectral sparsity level 
because of the introduction of a validation procedure, where the compressive samples obtained 
during one time step are divided into two subsets and a small testing subset is used for validation. 
The proposed halting criterion enables the sampling to be terminated at the earliest appropriate 
time while guaranteeing wideband spectrum recovery with a small predictable recovery error. 

B. Halting Criterion and Performance Analysis 

As shown in Table ID the halting criterion plays a crucial role in determining the performance 
of the ACSS framework. To improve the energy efficiency of CRs, we hope that the compressive 
sampling can be halted at the earliest appropriate time such that the current spectral estimate 
X p is a good estimate to X p (i.e., the spectral recovery error 11 X p — X p \| 2 is sufficiently small). 
However, the spectral recovery error \\X P — X p \\ 2 is typically not known because the real DFT 
X p is unknown under the sub-Nyquist sampling rate. Thus, using traditional CS approaches, we 
do not know when we should halt the compressive sampling. To solve this problem, we define 

4 The size of the testing subset v p is given as an input, which is chosen according to the following Lemma 1 in the noiseless 
case or Theorem 2 in the noisy case. We then have the size of the training subset r p = M p — v p . 
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Inputs 

Frame length L , minimum data transmission duration 
T m i n , sampling rate f s , time step r, size of testing 
measurements v p , recovery error threshold w, 
confidence factor 77 , energy detection threshold A. 

1. Initialize the time step index p = 1. 

2. Repeat 

a) , perform compressive sampling using f s , obtaining 

the measurement set y p \ 

b) . partition y p into the training subset R p and the 

testing subset V p 4 ; 

c) . use a spectral recovery algorithm to estimate the 

spectrum from R p , and obtain the spectral estimate 


3. 

4. 

X 


d). calculate and update the validation parameter 


\\V v -* v F “AX 


pN^P II 1 . 


Pp — 

e). update the time step index p = p + 1. 


Until the halting criterion p p < w(l — is 

true, or p > L ~'Rnin 

Stop sub-Nyquist compressive sampling. 

If the halting criterion is true, 


Hi 

a) perform energy detection ||2f p || 2 ^ A; 

Ho 


b) for Ho, transmit data via un-occupied bands. 

for Hi, return and report the spectrum is occupied. 
Else 

Increase f s and wait for next spectrum sensing frame. 

End 


the validation parameter p p to serve as a proxy for the actual recovery error: 

a \\V P ~ 

Pp = ---, (3) 

v p 

In the following lemma, we give a result on the relationship between the validation parameter 
p p and the actual spectral recovery error \\X P — X p \\ 2 : 
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Lemma m. For a given confidence factor rj G (0, |), £ G (0,1), v p = Cr/ 2 log| where 

/ tttjN / tttjN 

V 2 Pp V ~^2~Pp 


C denotes a positive constant, the confidence interval 
estimate of the unknown parameter \\X P — X p \\ 2 such that 


1+T] 


1 - 7 ] 


can act as a good 


Pr 


wpN_ 

2 'P 

1 + 7/ 


— II x p — x p \\ 2 < 


wpN_ 

2 'P 

l — rj 


>i-e, 


(4) 


where the minimum confidence level 1 — £ can also be written as 1 — 4exp(—^-) when v v is 
given. 

See Appendix A for the proof of Lemma 1. 

Remarking 1: We see that the actual spectral recovery error 11 X p — X p 11 2 can be directly linked 
to the validation parameter p p in ©. Even though the actual spectral recovery error || X p — X 1 


is not known, we can predict that it lies in a known confidence interval 


/ TTJjN 

' 2 #P 

1+7] 5 


/ Wfp 

1 ~v 


p 11 2 


with 


a confidence level higher than 1 — 4exp(—The confidence factor // determines the width 
of the confidence interval how uncertain we know about the unknown spectral recovery. For a 
given rj, increasing the value of v p (i.e., using more measurements for validation) will help to 
improve the confidence level. Additionally, we note that the choice of the parameter C depends 
on the concentration property of random variables in the matrix ll32ll . Given a good \F, e.g., 
the testing matrix with random variables following either the Gaussian or Bernoulli distribution 
as used in this paper, C can be a small positive constant. The benefit of the proposed algorithm 
will change with different testing matrices: This is because, given the confidence factor 7 / and 
the size of the testing set v p , different testing matrices will lead to different values of C, and 
thus result in different confidence levels. 

Theorem 1: Using the proposed ACSS, for a given confidence factor // G (0. \) and spectral 

is met, we can find a 


npN 


VpV 


recovery error threshold w, if the halting criterion p p < zu(l — rj) 

good spectral estimate such that ||X P — X p \\ 2 < w with a probability higher than 1—4exp( c 

See Appendix B for the proof of Theorem 1. 

Remark \ 1H\2: We can see that, using ACSS, the probability of finding a good spectral 


5 In CS, an estimate x can be obtained by using an ti or mixed G/G-based recovery algorithm. However, the similar- 
ity/difference between x and the actual signal x is not known because the actual signal cannot be directly obtained under the 
sub-Nyquist rate. This lemma aims to find how far x is from x (equivalently X from X ) by considering the ^2 metric \\x — x\\ 2 , 
in order to halt compressive sampling for saving energy at CRs. 


February 24, 2015 


DRAFT 





















12 


estimate exponentially grows as v p increases, i.e., as more compressive measurements are used 
for validation. Once the halting criterion has been met, the compressive sampling will be 
immediately halted as shown in Table HI Furthermore, we note that Theorem 1 can be reshaped 
when the minimum confidence level is given. That is, to find a good spectral estimate such that 
\\X P — X p \\ 2 < w with a confidence level higher than 1 — £, we use the halting criterion 

<5) 

From the relationship between the halting criterion and the ACSS performance as given in 
Theorem 1, we can see that this ACSS framework can decrease the probabilities of excessive or 
insufficient numbers of compressive measurements. 

IV. ACSS in Noisy Environments 

When performing compressive spectrum sensing, there may exist measurement noise due to 
the quantization error of analog-to-digital converters or the imperfect design of sub-Nyquist 
samplers. In this section, we extend the use of ACSS to such noisy environments, and will 
analyze the validation approach to fit the proposed framework. 

Given the noisy compressive measurements, the training subset R p and the testing subset V p 
can be written as 

Rp QpFpjyXp + tiji, (6) 

and 

V p = ^ p F p ^X p + n V) (7) 

respectively, where hr and ny denote the measurement noise introduced during the compressive 
measurement (e.g. generated by signal quantization). Without loss of generality, we model both 
ur and fty as circular complex additive white Gaussian noise (AWGN) with their components 
obeying a distribution CM { 0, d 2 ). 

We expect that compressive sampling can be halted if the current spectral estimate X p is 

very close to the actual spectrum X p . To find this good spectral estimate, we adopt the halting 

criterion \p p — < 0 due to the following: 

Theorem 2: Using ACSS in noisy environments, for any accuracy parameter 6 > 0, 5 > 
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0, q G (0,1), and v p = In (| 


- ~^2 ; to find a good spectral estimate such that X p is 


sufficiently close to the actual spectrum X p , the halting criterion satisfies 


Pr 


\Pp ~ \l 2 S \ < e 


> 1 ~ 6 , 


( 8 ) 


Vpff \ 


where the minimum probability 1 — g can also be written asl — g = 1 — 2 exp j ■ 

The proof of Theorem 2 is given in Appendix C. 

Remark 170 7: Theorem 2 addresses the issue of finding a good approximation of X p in the 
noisy case by using the halting criterion \p p — | < 0. The accuracy parameter 6 in Theorem 

2 has a known relationship with the parameters v p , S, and g. Given a fixed confidence level 1 — g, 
there is a trade-off between 6 and the size of the testing set v p : at the expense of accuracy (i.e., 
a large value of 9), v p can be small. Additionally, we find that the probability of \p p — yf^5 1 < 9 
rapidly increases as v p increases. That is, using more measurements for validation, we have a 
higher probability of finding the good spectral estimate. 

Taking advantage of Theorem 2, we extend the use of ACSS (based on Table I) to noisy 
environments. The inputs in Table I will be adjusted to ‘frame length L, minimum data trans¬ 
mission duration T min , sampling rate f s , time step t, size of testing measurements v p , accuracy 
parameter 9, noise variance S, and energy detection threshold A.’ The whole work flow of ACSS 
in noisy environments remains the same as in Table I except that the halting criterion is changed 
1° I Pp — v^l < 9. Using the proposed ACSS under the condition that the spectral sparsity level 
is unknown and the effects of measurements noise are not negligible, compressive sampling 
can still be halted in the correct time and the problems of excessive or insufficient numbers of 
measurements can be avoided. 


V. Sparsity-Aware Spectral Recovery (SASR) Algorithm 

Traditionally, greedy recovery algorithms, e.g., OMP, will iteratively generate a sequence of 
estimates X p , X p , • ■ • . X* which can lead to a good spectral estimate under certain system 
parameter choices. Using t = k iterations in OMP, we can obtain a A-sparse vector X p as an 
estimate of the actual spectrum X p [18]. That is, the sparsity level k is required to be an input 
for OMP, and this input is usually required in most other greedy recovery algorithms. However, 
in CR systems, the spectral sparsity level k is often unknown or difficult to estimate, which 
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can result in early or late termination of that traditional greedy algorithms (i.e. underfitting and 
overfitting problems). On the other hand, we note that the proposed Theorem 1 and Theorem 
2 are used to identify a satisfactory spectral approximation of the actual spectrum from an 
estimate sequence by using appropriate halting criteria. The theorems thus can be applied in 
recovery algorithms to solve the underfitting and overfitting problems: The halting criteria can 
help terminate the iterations at an appropriate time without requiring the knowledge of k, and an 
estimate of the spectrum (i.e. the recovered spectrum) will be obtained. To this end, we propose 
a so-called sparsity-aware spectral recovery (SASR) algorithm to handle the spectrum recovery 
problem given unknown instantaneous spectral sparsity level k, as shown in Table HU 

Using recovery algorithms, we aim to obtain an estimate of x p or its spectrum X p from R p . 
Since x p is /c-sparse (i.e. x p has k non-zero components), the vector R p = <& p x p is a linear 
combination of k columns from We thus need to identify which column of is involved 
in R p , by choosing the column of <F ; , that is mostly correlated to the residual of R p at each 
iteration. As shown in Table HU using the proposed SASR algorithm, we find the support index 
( p t that can maximize the correlation between the remaining part of R p and the measurement 
matrix at each iteration. A new support index set OJ is then formed by merging the previously 
computed support index set with the current support index (p *. In the step 2 -d) of Table ITU we note 
that <F p (fT) denotes a sub-matrix of <E» P that is obtained by selecting only those columns whose 
indices are within OJ and setting the remaining columns to zeros. We use the Moore-Penrose 
pseudoinverse to solve the least squares problem, and then obtain a new spectral estimate X p . 
To verify whether X p is a good spectral estimate, we calculate the parameter p p using the testing 
subset V p and the spectral estimate X p . After that, the residual p p is updated and the algorithm 
iterates on the residual. Finally, the spectral estimate that breaks the loop of step 2 is returned 
as the output of SASR algorithm. 

In the SASR algorithm, the halting criterion can be adjusted when different inputs are given. 
For example, for noisy measurements, if the key parameter 6 is of interest, we could set up 6 
by using an expected minimum confidence level 1 — g: 


9 = 



(9) 


4 v p 
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Sparsity-Aware Spectral Recovery (SASR) Algorithm 


15 


Inputs: 

Training subset R p , testing subset V p , testing matrix 
V p , measurement matrix $> p , recovery error threshold 
w (noiseless case), confidence factor 77 (noiseless case), 
noise variance S 2 (noisy case), accuracy parameter 0 
(noisy case), max sparsity & max . 

1. Initialize: t = 0, U° = 0, 7 ° = R p , and p ° = 0. 

2. While |p* - y/Z5\ > 0 and t < & max , do 

a) . Update the iteration index t = t + 1. 

b) . Identify the support index 

V* = arg max ie[1)PJV ] | < > |. 

c) . Update the support index set U {<p*}. 

d) . Solve the following least squares problem and 

obtain a new spectral estimate: 
xp = ar s min x p II Rp ~ ®p( nt ) F pNX P h. 

e) . Calculate the validation parameter 

n t = ll?P-ttpF^*«||i 
^P v P 

f) . Update the residual 7 * = R p — ^ P F~^X^. 

3. Return the spectral estimate: X p = X*. 

Halting Criterion: 

{ Pp < ro(l — Tj) For noiseless measurements. 

|/ 9 * — < 9 ,For noisy measurements. 


where [x] + denotes max(x, 0). We can then halt the iteration in the correct iteration index with 
a confidence level greater than 1 — g. The proof of © is similar to the proof of Theorem 2: To 
find the accuracy parameter 9, we use the following quadratic equation regarding 9 from (l24l) : 

v p • 9 2 — Un 6 ■ 9 — (4 — 7r) In S 2 = 0. (10) 

It can be easily determined that the discriminant of the above quadratic equation is positive, and 
we obtain the distinct real root as given by ©. 

We note that one important advantage of the proposed SASR algorithm is that it does not 
require the knowledge of instantaneous spectral sparsity k; Instead, it only requires the sparsity 
upper bound k max which can be easily estimated by long-term spectral usage observations. 
Additionally, traditional greedy algorithms employ the residual ||7p|| 2 smaller than a threshold 
as a halting criterion, where the residual 11 ^ 11 2 decreases or remains as the number of iterations 
increases. An inappropriate threshold in greedy algorithms could lead to either under-fitting or 
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over-fitting. By contrast, using the proposed algorithm, we monitor the validation parameter 
Pp instead of the residual ||7p|| 2 ; We can terminate the iteration in the correct iteration index 
with a high probability which exponentially increases with v p increasing or S decreasing. More 
measurements for validation can significantly reduce the risk of data under-/over-fitting. Further¬ 
more, compared with traditional recovery algorithms, the proposed SASR algorithm reduces the 
number of iterations and thus the complexity. The running time of the proposed SASR algorithm 
is dominated by the step 2-b) as shown in Table II, whose cost is 0(r p pN ) for one iteration. At 
iteration t, the least squares problem can be solved with marginal cost 0(tr p ). As the iteration 
can be terminated at the correct index t = k with a high probability, the total running time of the 
proposed SASR algorithm is thus 0(kr p pN). By contrast, as discussed in [33], the total running 
time of the traditional OMP algorithm is 0(k mm M p pN), as k max iterations are likely needed 
(i.e. an overrun occurs) when the instantaneous spectral sparsity is unknown. The computational 
complexity of the proposed SASR algorithm is thus lower than that of the OMP algorithm. 

VI. Simulation Results 

In our simulations, the wideband analog signal model in ll27ll was adopted; Thus, at a CR the 
wideband signal of interest can be written as 

N b 

x{t) = ^2 \TEiBi • sine (B t (t - a)) • cos (2nfi(t - a )), (11) 

z=i 

where x(t) consists of N b non-overlapping subbands, and £), B t , and fi denote the received 
power, the bandwidth, and the centre frequency of subband l at the CR, respectively. The function 
sinc(.x) denotes the normalized sine function, i.e., sinc(.x) = S11 ^ ,r) , and a denotes a small 
random time offset. The major simulation parameters are listed in Table III unless otherwise 
stated. The overall bandwidth of the signal x(t) is W (Hz). The frequency range of subband l 
is [fi — fi + ^], where fi is randomly located within [-y ~ W — ^]. We note that in our 
simulations, we have the spectral occupancy (Yld=iBi)/W calculated as 0% ~ 8% according 
to the set up in Table III, which is particularly relevant to practical CR networks. The sparsity 
level k thus exists in the range of 0%iV ~ 8 %N; Given a fixed value of k, the selection 
of Bi will be conditional. In addition, during the spectrum sensing duration, we assume the 
signal from primary users and the channel conditions are quasi-stationary. We adopt the sub- 
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Nyquist rate f s (f s < 2 W) for sampling the wideband signal throughout simulations and employ 
compressive measurement matrices with standard normal distribution. Please also note that, the 
size of compressive measurements is closely related to the choice of r because M p = f s pr. 
A smaller r will not provide a satisfactory spectral recovery rate due to insufficient training 
data. On the other hand, a larger r will require more memory space to store the compressive 
measurements. Here, we assume r = 0.2 ps considering both the spectral recovery requirement 
and memory requirement. Using the settings in Table III, instead of N = 2 Wt = 1000 Nyquist 
samples, we have f s r = 200 measurements in each time slot, among which v p measurements 
are used for validation and the residual is used for recovering the spectrum. 

TABLE III 

Simulation Parameters for ACSS 



ACSS System Parameters 

Settings 

Symbol 

Description 

w 

Signal bandwidth of interest 

2.5 GHz 

N b 

Number of subbands 

4 

k 

Spectral sparsity level 

32 

Bi 

Bandwidth of subband l 

0-50 MHz 

fi 

Center frequency of subband l 

Ml ^ ^ 

2 vy 2 

Ei 

J? 

Received SNR of subband l 

7 - 25 dB 

a 

Small random time offset 

0 — 0.1 /is 

L 

Frame length 

4 /is 

T min 

Min data transmission time 

2.4 /is 

T 

Small time step 

0.2 /is 

fs 

Sub-Nyquist sampling rate 

1 GHz 


Firstly, in Fig. |4] we verify the validity and accuracy of the confidence interval shown in 
Lemma 1 using the settings in Table III. Effects of the confidence factor rj and the number 
of testing measurements v p on the confidence level are also demonstrated. The value of C in 
Lemma 1 depends on the concentration property of random normal distributed variables in the 
matrix \P, and without loss of generality we choose C = 1 to obtain a theoretical minimum 
confidence level in this figure. The confidence level shown in Fig. 0| represents how often the 
actual spectral recovery error lies within the confidence interval. We can see that the wider the 
confidence interval we are willing to accept (with using a larger rf), the more certain we can 
be that the actual recovery error would be within that estimated range (i.e., a higher confidence 
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level obtained). It can also be seen that the confidence level improves with v p increasing; That is, 
with more testing data, validation results are more trustworthy. The minimum confidence level 
shown in Fig. |4] indicates a theoretical lower bound of how sure the estimation range can be for 
given settings of r\ and v p . With either rj or v v increasing, the lower bound is more close to the 
simulated confidence level. 



40 50 60 70 80 90 100 

Number of testing measurements v p 

Fig. 4. Confidence level in Lemma 1 and the effects of the confidence factor 77 and the number of testing measurements v p . 


Using the above settings in Table III, in Fig. |5] we present the proposed validation param¬ 
eter the actual recovery error | X p — V p || 2 , and the proposed confidence interval 

/ tttjN / tttjN 

V 2 Pv V ~^2~Pp 
I+77 7 1 —77 


when the number of time steps increases. To make the confidence interval 
narrower and more precise, we consider rj = 0.2, and show the effects of changing the number 
of testing measurements v p by using two sub-figs. We can see that the proposed validation 
parameter is very close to the simulated recovery error regardless the number of time steps or 
the value of v p varying, and can therefore be used to predict the actual recovery error. With p 
increasing, the sensing duration is increased step by step, and the sensing will be halted if the 
recovery error is sufficiently small, for example we need p = 6 in Fig. [5] (a) and P = 3 Fig. |5] 
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(b). It is also illustrated that the more testing measurements, the fewer time slots are required 
to recover the spectrum. The remaining time slots can then be used for data transmission to 
improve system throughput. 
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Fig. 5. The comparison of the proposed validation parameter y 2n ^p P in Lemma 1 and the actual recovery error \\X P — X P \\2 
when the number of time steps increases. The blue bars give the confidence interval in Lemma 1 when the confidence factor 
r] = 0.2. (We consider the number of testing measurements v p = 40 in (a), and v p = 60 in (b).) 


Applying the halting criterion in Theorem 1, we now demonstrate the performance of the 
proposed ACSS compared to a traditional CS system in Fig. 0 when the spectral sparsity 
level k varies. We consider two cases of the sub-Nyquist sampling rate, f s = 750 MHz and 
1GHz respectively, and rj = 0.2. We define the successful spectral recovery as the case with 
the mean squared error not larger than 0.001. It is evident that the proposed ACSS can not 
only automatically adapt the number of measurements to the unknown sparsity level k, but 
also considerably improve the spectral recovery performance compared with the traditional CS 
approach no matter for either value of f s . The lower the spectral level, the higher the successful 
recovery rate obtained. It is also illustrated that a larger number of validation measurements v p 
does not always guarantee a better recovery performance: The two red curves crossover with k 
increasing. It is because that for a fixed set of compressive measurements, a larger value of v p 
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means a smaller training subset used for recovery which may lead to worse spectral recovery 
performance especially for a higher sparsity level. 



Fig. 6. The performance comparison of the proposed ACSS system and the traditional CS system HQ when the spectral 
sparsity level k and the sub-Nyquist sampling rate f s vary. Successful spectral recovery is defined as the spectral recovery with 
the mean squared error not larger than 0.001. 


We now extend the use of ACSS in noisy measurement environments. Fig. |7] shows the 
comparison between the simulated probability of the halting criterion \p p — y5|5| < 0 holding 
true and the theoretical probability lower bound 1 — g in Theorem 2 when the number of testing 
measurements v p and the accuracy parameter 0 vary. To guarantee a high confidence level, we 
consider 6 = 0.65, 0.655, and 0.75. It is shown that the lower bound is very tight and thus can 
be used to predict the actual probability. With a high probability of the halting criterion holding 
true, we can expect that a good estimation of the spectrum is found. Fig. [7] also shows that given 
a fixed confidence level of the halting criterion, at the expense of accuracy (i.e. a larger value 
of 9), we can use fewer testing measurements. In addition, the confidence level exponentially 
increases with v p increasing. That is, using more testing measurements, we have a better chance 
of finding a good spectral estimation. 

Fig. [8] shows the performance comparison of the proposed SASR algorithm and the traditional 
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Simulated probability, 0=0.658 
Simulated probability, 0=0.708 
Simulated probability, 0=0.758 
Theoretical probability lower bound, 0=0.658 
Theoretical probability lower bound, 9=0.708 
Theoretical probability lower bound, 0=0.758 


80 


The Number of Testing Measurements (v 


Fig. 7. The comparison between the simulated probability of the halting criterion \p p —y/^S\ < 6 holding true and the theoretical 
probability lower bound 1 — q in Theorem 2 when the number of testing measurements v p and the accuracy parameter 6 vary. 


OMP algorithm when the actual spectral sparsity level k and the noise variance 5 2 vary in noisy 
environments. We assume the sparsity level k is unknown when performing recovery, but we 
know that k exists in the range of 0%N ~ 8 %N, i.e., k max = 8%N = 80 according to the 
settings in Table III. The received signal-to-measurement-noise (SNR) ratios of these subbands 
are set to be randomly distributed between 7 ~ 25 dB as listed in Table III. We consider S 2 = 1 
and 4, respectively, and v p =40. The recovery mean squared error in the noisy case is defined 
as E {X p i — Xp^y/X 2 ^ where X pi denotes the 7-th component of the vector X p . We can 
see that compared to the traditional OMP algorithm, the proposed SASR provides much better 
spectral estimation and recovery performance, regardless the values of S 2 or k. It is because that 
the OMP algorithm tends to use more number of iterations to avoid under-fitting problems and 
to prevent missed detection leading to harmful interference to PUs in CR networks. However, on 
the other hand, using more number of iterations will cause over-fitting problems and exaggerate 
minor fluctuations in the data which will finally result in poor recovery performance. We would 
like to emphasize that the proposed SASR algorithm will obtain a more significant performance 
improvement in practice, as there always exists a larger uncertainty of k in realistic wideband 
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CR networks. 



Fig. 8. Performance comparison of the proposed SASR algorithm and the traditional OMP algorithm when the spectral 
sparsity level and the noise variance S 2 vary in noisy environments. The recovery mean squared error is defined as 
E (X p ,i — X p ,i ) 2 / Xp ti where X p/l denotes the <-th component of the vector X p . 


VII. Conclusions 

In this paper, we have proposed a novel framework, i.e. ACSS, for compressive spectrum 
sensing in wideband CR networks. ACSS enables a CR to automatically adopt an appropriate 
number of compressive measurements without knowledge of the instantaneous spectral sparsity 
level, while guaranteeing the wideband spectrum recovery with a small predictable recovery 
error. This is realized by the proposed measurement procedure and the validation approach. 
The validation approach can accurately estimate the actual spectral recovery error with high 
confidence by using only a small amount of testing data. The proposed ACSS thus avoids 
excessive or insufficient numbers of compressive measurements, and helps enhance the recovery 
performance and improve the energy efficiency of CR networks. In addition, we extend the use 
of ACSS to noisy environments and propose another validation approach: If a good spectral 
estimate exists, the validation approach will find it with a high probability. Furthermore, we 
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have proposed the SASR algorithm to recover the wideband spectrum without requiring the 
knowledge of the instantaneous spectral sparsity level. The SASR algorithm can autonomously 
adopt a proper number of iterations, and thus solve the under-fitting or over-fitting problems 
which commonly exist in most other greedy recovery algorithms. 

Simulation results have shown that the proposed ACSS framework can correctly stop the 
signal acquisition that saves both spectrum sensing time and signal acquisition energy in both 
noiseless and noisy environments. Compared to traditional CS, ACSS can not only provide better 
spectral recovery performance, but also help improve system throughput and energy efficiency 
of CR networks. In addition, the proposed SASR algorithm can achieve lower recovery mean 
squared error and better spectrum sensing performance compared to the OMP algorithm. We 
emphasize that the ACSS framework is not limited to CR networks; The proposed validation 
approach could be extended to other CS applications, e.g., a CS enabled communication system 
where the approach could be used to terminate signal detection at an appropriate time. Since RF 
spectrum is essential to wireless communications and the wideband techniques could potentially 
provide higher capacity, the proposed framework in this paper is thus particularly valuable and 
can have a wide range of applications, e.g., in broadband spectral analyzers and ultra wideband 
radars. 


Appendix A 
Proof of Lemma 1 


The Johnson-Lindenstrauss Lemma lf33ll states that a set of N points in a high-dimensional 
Euclidean space can be mapped (with low distortion) into a Euclidean space of much lower 
dimension v p , and all distance are preserved up to a multiplicative confidence factor between 
1 — rj and 1 + r). With the aid of the Johnson-Lindenstrauss Lemma in Theorem 5.1 of Il33ll . we 
get v p = Ci)- 1 log | where C' denotes a positive constant, and 


Pr 


(1 — ^ 7 ) 11^112 < 


|^ p f| 


a 


7VV n 


< (l+r))\\x\\ 2 


>!-<£• 


( 12 ) 


Replacing x in (IT2l) by F~^(X P — X p ), we have the inequality (fl3l) . Jointly using © and ©, 
we simplify (fT3l) to (fl4l) . Applying Parseval’s relation to (fl4l) . we then get (fl5l) . The equations 
(I13H15I) are shown on the top of the next page. And finally, we obtain 
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Pr 


(i-,)I|f^(x p -x p )|| 2 < 


|*„FrifJSf_— -X_)Hi , - - 

pN ' -<(i+k)I|f^(x p -x p )|| 2 


\/V 


Pr 


Pr 


7T V p 

7 r 
2‘ 

(i - n)\\x p - x p \\ 2 < x l^ Pp < (i + v )\\x p - x p \\ 2 


(1- V )\\F^(X p -X p )\\ 2 < J- Pp < (1+ V )\\F^(X p -X p )\\ 2 


> 1-6 (13) 
> 1-6 ( 14 ) 

> 1-6 ( 15 ) 


Pr 


This completes the proof. 


npN_ npN_ 

2 Pr <\\X P -X rh <^ 2 


1 + f] 


1 — 7] 


> 1-6 


(16) 


Appendix B 
Proof of Theorem 1 

Using the probabilistic inequality Pr (B) > Pr(AflS), we can obtain the following inequality: 


Pr 

Pr 


\\X P -X P \\ 2 <^ 




> 


/ 7TpN , . / 7TpN 

V 2 Pp \W _ y IL ^ V 2 Pp 


1 +r) 


< \\x r - x r h < 


(17) 


J TTpN 

If the halting criterion p p < w{l — p)J is met, we have v ^ - < w, then the following 
inequality holds: 


Pr 




||Xp ||2 ^ 'CXJ 

> Pr 


ny>-y>ii2< 


npN_ 

2 ‘P 

1 — T] 


(18) 


With the aid of Lemma 1, jointly using ©, (fT7h . and (fl8h . we have 


Pr 


\\X P - X p \\ 2 < zu > 1 - £ = 1 - 4exp(-^-). 


(19) 


This completes the proof. 
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Appendix C 
Proof of Theorem 2 

Suppose that X p is a good spectral estimate such that X p = X p , we can write the validation 
parameter by using © and © 

\\% - _ \\nv\\i _ EZi \<\ 


Pp 


Vp Vp Vp 


( 20 ) 


Define a new variable Di = \n l v \ — \J\6. Since the measurement noise n l v ~ CA/"(0, 6 2 ), we 
have Di following the Rayleigh distribution with zero mean and variance Additionally, 

we can find that \Di\ < 3(5 with 99.7% confidence (according to the three-sigma rule) which is 
like being almost sure. Using the Bernstein’s inequality [34], we obtain the following inequality: 


Pr [|E£i D,\ > C] = Pr [lElii l"’vl - «p\/P| > C] 

C 2 /2 


< 2 exp — 


= 2 exp ( — 


E^E[A 2 ]+niax(|A|)C/3 

c 2 


( 21 ) 


Letting ( = v p 6 and using 


(4 — 7r )v p 8 2 + 2 (,8 J 

we can rewrite the above inequality 


Pr 




> e 


< 2 exp 


v p 9' 2 


(4 — 7 t)S 2 + 266 


( 22 ) 


Equivalently, (l22l) can be written as 


Pr 


Pp 


< 6 


>1 — 2 exp 


v P 6 2 


(4 - 7 t)6 2 + 2 96 

Aligning the right item of (l23h with the lower bound 1 — g, after manipulation we obtain 

'2\ (4 — tt)6 2 + 266 


(23) 


v p = In 


6 2 


(24) 


This completes the proof of Theorem 2. 
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